arXiv:1504.05350v2 [math.PR] 29 Feb 2016 


CONNECTIVITY THRESHOLD FOR RANDOM SUBGRAPHS OF 

THE HAMMING GRAPH 

LORENZO FEDERICO, REMCO VAN DER HOFSTAD, AND TIM HULSHOF 


Abstract. We study the connectivity of random subgraphs of the d-dimensional 
Hamming graph H{d, n), which is the Cartesian product of d complete graphs on 
n vertices. We sample the random subgraph with an i.i.d. Bernoulli bond per¬ 
colation on H (d, n) with parameter p. We identify the window of the transition: 
when np —log n —oo the probability that the graph is connected goes to 0, while 
when np — logn —>■ -|-oo it converges to 1. We also investigate the connectivity 
probability inside the critical window, namely when np — log n —>■ t £ R. We find 
that the threshold does not depend on d, unlike the phase transition of the giant 
connected component of the Hamming graph (see [1]). Within the critical window, 
the connectivity probability does depend on d. We determine how. 
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1. Introduction 

In this paper we investigate the random edge subgraph of d—dimensional Ham¬ 
ming graphs. Hamming graphs are dehned as follows: 

Definition 1.1 (Hamming graph). For integer n write [n] := {1,... ,n}. We define 
the d—dimensional Hamming graph H{d,n) as the graph with vertex set 

V = [n]^, 

and edge set 

E = {{v,w) : v,w & V, Vj 7 ^ wj for exactly one j}. 


We study a percolation model on the Hamming graph. We dehne the random 
subgraph Hx{d,n) as the random edge subgraph with uniform edge retention prob¬ 
ability p = Since the degree of every vertex in H{d,n) is d{n — 1), the 

parameter A thus indicates the expected number of outgoing edges from any given 
vertex. 

The phase transition for the existence of a giant component (i.e., when |Cma.x| ~ 
C|U| for C G (0,1)) was studied in [1, 8] for a larger class of hnite transitive graphs 
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that includes H{d,n), while the slightly supercritical behavior was analyzed in [6] 
and [7] for d = 2. 

In this work, we move away from the giant component critical point and we aim 
to determine the asymptotic probability that Hx{d,n) is connected for d fixed and 
n —)■ oo. The analogous problem was first solved for the Erdds-Renyi Random Graph 
(ERRG) in [3]. Observe that the ERRG arises as a special case of our problem if we 
put d = 1. We will follow the proof for the ERRG (see e.g. [5, Section 5.3]), but we 
hnd that at places the internal geometry of the Hamming graph plays an important 
role. To overcome this difficulty we use an induction on the dimension d and an 
exploration of the graph. 


2. Main Results 


Let Hn := Hx{d,n) be a sequence of random edge subgraphs of H{d,n) with 
parameter A = A(n). Given A we want to determine the asymptotic probability that 
Hn is connected. 

Theorem 2.1 (Gonnectivity threshold for Hx{d, n)). If limn^oo A — dlogn = f G M, 
then 


Fx{Hn is connected) —)■ e 


( 2 . 1 ) 


Consequently, 



( 2 . 2 ) 


These results show an interesting difference between the critical values of the gi¬ 
ant component threshold and the connectivity threshold. The critical probability 
of the former, pcc = (1 + o(l))) depends on d, while the latter, 

does not. This fact provides us with some insight into the structure of Hx{d,n) 
at the connectivity threshold: Gonsider the lower-dimensional “hyperplanes” (i.e., 
the subgraphs of H{d,n) induced by all vertices {vi,...,Vd) that satisfy a set of 
constraints of the form Vj = kj for some j G [d], kj G [n], see Dehnition 4.1 below). 
Note that these hyperplanes are isomorphic to Hamming graphs of lower dimension. 
From [1] we know that there exist values of A such that Hx{d, n) has a giant compo¬ 
nent while the intersections of Hx{d,n) with a hyperplane are subcritical (i.e., the 
largest components inside a hyperplane are of order O(logn)). But an analogous 
property does not hold for the connectivity threshold: if Hx{d,n) is connected with 
probability converging to 1, then the same holds for all its hyperplanar subgraphs. 

We believe this phenomenon holds in much greater generality than Hamming 
graphs: our proof of Theorem 2.1 can easily be modified to show that it also holds 
for the Gartesian product of d copies of the complete fc-partite graph, and we believe 
it to be true for a larger class of powers of high-degree transitive graphs. 
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2.1. Related literature. In [4] Erdos and Spencer studied the connectivity thresh¬ 
old of the hypercube H{d, 2), where they found that the connectivity threshold occurs 
around P = \ (also independent of d). Clark [2] studied the connectivity threshold 
of H{d, n) for n hxed and d —)■ cxo, showing that if 



and ^(d) a G (0, oo), then the probability that the percolated graph is con¬ 

nected converges to e““. Expansion of the above equation around n = oo shows that 
the d —?• oo limit for large values of n has the same behavior as the n ^ oo limit. 
Moreover, [10] shows that more generally, Cartesian products of hxed graphs have 
a connectivity threshold that only depends on their degree distribution as d —)■ oo. 

Sivakoff gives a statement analogous to our main theorem for site percolation 
in [11]. It should be noted that site and edge percolation are very different models 
on the Hamming graph, as can be seen for instance in the fact that connectivity 
of site percolation on is trivial, whereas connectivity of edge percolation on 
(i.e., the ERRG) is not. See also [12]. 

3. Poisson convergenge of isolated vertiges 

We start investigating the number of isolated vertices in the Hamming graph. As 
in the case of the ERRG, this provides a sharp lower bound on the window of the 
connectivity threshold. We dehne the number of isolated vertices 

^ •= 

iev 

where Gi is the connected component of vertex i. We prove that in the critical win¬ 
dow (i.e., when A —dlogn —)■ f G M) the random variable Y converges in distribution 
to a Poisson random variable. This proof is standard, and uses the same arguments 
applied to the proof given for the ERRG in [5, Section 5.3]. 

Let {x)n denote the nth lower factorial of x, i.e., (x)„ := x(x — l)(x — 2) ■ ■ ■ (x — 
n -|- 1). We will use the following lemmas, whose proofs are given in [5, Section 2.1] 
(for general versions see [9, Ghapter 6]): 

Lemma 3.1. A sequence of integer-valued random variables converges 

in distribution to a Poisson random variable with parameter /i when, for all r = 

1 , 2 ,..., 


lim E[(X0.] = /ih 

n—^oo 


(3.1) 
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Lemma 3.2. When X = Yliej 1* ® least r indicators then 

E[(X),] = P(li, = li^ = ... = li^ = 1), 

where the sum is over all sets of r distinct indices. 


Given Hn = (Ki, En), we want to prove that (3.1) holds for Yn := J2vi£Vn 
We will use Lemma 3.2 with an upper and lower bound on PA(lii = ■ • • = 1,^ = 1) 
where we take Ij to be the indicator function of the event that the vertex Vi is 
isolated. Observe that we have n'^\/{n^ — r)! different sets of distinct vertices of 
cardinality r. We call m := d{n — 1) the degree of H{d,n). 

The lowest probability comes from sets where none of the r vertices are adjacent, 
hence we bound 

/ ^ \ rm 

PA(ln = = • • • = Iv = 1) > 1 - - , 

y mj 

while the highest probability comes from sets where all the r vertices belong to the 
same 1-dimensional subgraph, hence 

r(r—1) 

/ A \ ^ 

PA(ln = 1,, = • • • = li, = 1) < 1 - - 

\ m J 


For n < r we can find better bounds but we do not mind, since we are interested 
in the asymptotic behavior when n ^ oo and r is fixed. By the transitivity of the 
Hamming graph we bound, using A = dlogn -|- f(l -|- o(l)), 

/ A 


Ea[(W).] > 


— r)\ 

-( 

— r)! 


1 - 

—dr log n—tr{l-\-o{l)) 


79^1 

Since --;-^- — .r, dr 


— r ! 


= n'^^(l — o(l)), we hnd 
EA[(F;)r] > (1 - o(l)) = 


Similarly 


EaKW).] < 




[n^ — r)! 


A 


m 


r(r — l) 


_ ^ • .y^-dr -tr(l+o(l)) A _ A 

[n^ — r)\ \ m 

= g-ir(l+o(l))_ 


r(r—1) 
2 
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This proves that for each r, E[(yn)r] —>■ e so that by Lemma 3.1 the distribution 

of Yn converges to Poi(e“*) when A — dlogn t, so that 

PA(y„ = 0)^e-'^“*. (3.2) 

Furthermore {Hn connected} C = 0}, so we conclude that for A — dlogn —)■ t 

limsup PA(LfA(d, n) is connected) < e“® . 

n—>-oo 

It remains to prove the matching lower bound, i.e., that in the critical window 
Fx{H\{d,n) is disconnected | 14 = 0) —?• 0. 

4. Connectivity conditioned on no isolated vertices 

We prove (2.1) via induction on d. (The standard “tree counting” proof for the 
ERRG given in [5, Section 5.3] is too involved in the presence of geometry.) 

Induction hypothesis. //lim„^oo A — (d — 1) logn = t G M, then 

¥x{Hx{d — l,n) is connected) —)■ e”*^ , 
i.e., (2.1) holds for H{d — l,n). 

We initialize the induction by noting that H{l,n) is a complete graph, so the 
random subgraph Hx{l,n) has the same distribution as an ERRG with p = 

For this case it is proved in [3] that (2.1) holds. 

Definition 4.1 (Hyperplanes). Given H{d,n) = {V,E), define the hyperplanes 
Gjk = {Vjk, Ejk) for some j G [d] and k G [n] as 
t> Vjk = {(R, *2, ■ ■ ■, 4) e R : ij = k); 

O Ejk = {{v,w) e E :v,w e Vjk}. 

Note that H{d,n) has exactly dn hyperplanes and that they are all isomorphic to 
H{d — 1, n). 

We define as the intersection of the Random Edge Subgraph Hx{d,n) with the 
hyperplane Gjk, for each pair j, k. 

The crucial idea of our proof is to show that once we have enough internally 
connected hyperplanes, all the remaining non-isolated vertices are connected to these 
connected hyperplanes with high probability. 

To use this argument, we condition on the event that a certain set of hyperplanes 
is internally connected. To ensure independence under this conditioning, we use 
disjoint edge sets to create the connected hyperplanes and to connect the remaining 
non-isolated vertices to them. 

We define the sets L = {1, 2,..., [n/2j} and R = {\n/2\ Y 1, \n/2\ + 2, ..., n}. 
For each j G [d], we divide V into two sets: 

VlU) '.= {v eV ■. Vj G L] and 


VrU) := {t G R : V, G R}. 


6 


LORENZO FEDERICO, REMCO VAN DER HOFSTAD, AND TIM HULSHOF 


This induces a partition on the edge set E\ 

EllU) ■= {(t,w) e E :v,w e Viij)}, 

ErrU) := {(n,w) e E :v,w e 

ElrU) ■= e E :v e VlU),w G Vr^])}. 

For each hxed j these sets are disjoint, so the occupation status of the edges in one 
set is independent from the occupation status of edges in the other two sets. Note 
that due to the geometry of H[d, n) the exact composition of the sets L and R is 
not relevant, only their size matters. 

For some hxed a (to be determined later) we dehne the events 

ErU) ■= is connected for more than different k E R}, 

ElU) ■= is connected for more than |cm different k E L}. 

We dehne B := H Br^J)). In the hnal steps of the proof, on page 9, we 

will show that F\{B) —)■ 1. 

Note that the event B states that there exist non-parallel internally connected 
hyperplanes, so when B occurs, the geometry of the Hamming graph then ensures 
that all internally connected hyperplanes are in the same connected component, 
deterministically. We will not make explicit use of this fact. Instead, we now prove 
that on B, with high probability, in the critical window Hx{d,n) consists only of 
the giant component and isolated points. 

Proposition 4.2. Let A — dlogn — >■ t G M and d > 2, and let J be the set of all 
isolated points. Then 

lim PA({(e,„ax U J) 7 ^ fl H) = 0. 

n^oo 

Proof. We have to prove that with probability converging to 1 all edges present in 
the graph are connected to the giant component. We know that \E\ = ]^dn^{n — 1). 
We write Z for the number of edges that do not connect to the giant component. If 
Z = 0, then the claim holds, since all points outside the giant component must be 
isolated. We will prove that indeed Ea[Z | H] —)■ 0. 

Choose an edge iy^v') E E and let i E [d] be the unique direction such that 
Vi 7 ^ v[. Choose j E [d] with j ^ i and apply the partition dehned above. Suppose 
that Vj = v'j E L (the argument for Vj = Vj E R is identical). Dehne the event 

E := {v,v' are not connected to any internally connected hyperplane}. 

Since B C Br^J) for all j it follows that 

Pa(f n h) < Pa(f n Br{j)) < Pa(f | Br{j)). 

To bound Pa(F’ | BrIJ)) we explore the graph starting from the vertices v,v', 
with the following algorithm: 
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Step 1: Given the edge {v, v'), set as active the two end vertices: a = v, a' = v'. 
Initialize the set of searched vertices S = 0. 

Step 2: Set S = S U {a, a'}. Check all the edges {ai,w), {a 2 ,w) such that w 
belongs to a connected hyperplane Gjj. with r E R. If they are all vacant, 
then go to Step 3, else stop. 

Step 3: Check all the edges {a,w), {a',w) such that w G \ S. We define 

the set W = {w E Vi^j) \ § : one or both of (a, w) and (a', w) are occupied}. 
If \W\ > 2 then go to Step 4, else stop. 

Step 4: Choose w, w' E W according to an arbitrary but fixed rule and set 
them as the active vertices: a = w and a' = w'. Return to Step 2. 

Activating only two vertices at each cycle of the algorithm allows for some control 
over the depletion of points outside the connected hyperplanes. This means that the 
algorithm can terminate before the starting edge has been connected to the giant 
component or before its connected component has been completely explored. This 
is not a problem: the calculations below show that this algorithm gives a sufficiently 
sharp result to prove the claim. 

Indeed, we want to show that the probability that the exploration process ter¬ 
minates before hnding a connection to one of the internally connected hyperplanes 
among : r G R} has probability tending to zero. That is, we want to show that 
with high probability, the algorithm does not terminate during Step 3. We write T 
for the cycle at which this happens. We set T = cxd if the process hnds the giant 
component, namely if the algorithm terminates during Step 2. 

Note that the algorithm is designed with certain independencies. Indeed, the 
event Bnij) depends only on the edges in Err^J), Step 2 of the exploration only 
depends on edges in Err^J), and Step 3 only depends on edges in Erl^J). 

Let Pg = Pg{k) be the probability of hnding a connection to one of the internally 
connected hyperplanes among : r E R} during the k-th cycle of the exploration 
algorithm, conditioned on the event BR{j) and on the event that the algorithm has 
not yet terminated. We bound 

1 - A < (i - 

for a constant G that depends on t. (This bound does not depend on k because the 
algorithm terminates as soon as the exploration hnds a connected hyperplane, so 
there is no depletion of points inside the connected hyperplanes.) 

Let W denote the number of vertices discovered in Step 3 of the k-ih cycle of the 
exploration and let 

Pk,2 '■= P\{Nk > 2) = 1 — Pa(W = 0) — P\{Nk = 1). 

Each vertex v E VR^j) has (d —l)(n —1)-f [n/2j neighbors in Vr^J), and at time k 
at most 2k of them have already been explored, so W stochastically dominates a 


A 

m 


— = e 


— Aq: 


/‘^(l + o(l)) < Gn-", 
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Bin((2(i — l)(n — 1) — 4/c, A/m) random variable. We bound 


Pa(W e {0,1}) < 1 


m j 


(2d—l){n—l)—Ak 


X 


+ (2d-l)n— 1- 

mV m J 




(2d-l)(n-l)-4fc-l 


So we obtain for some constant c 


1--^ 
m J 


{2d—l)(n—l)—Ak 


1 + 


2d -1 A 

d 1-A 

m 


<cAn“^‘^+^ ( 1 - — 
m 


—4k 


If T = s, then the exploration does not reach a connected hyperplane during the 
hrst s cycles and then the algorithm terminates on Step 3 of the s-th cycle, so we 
can bound 


Pa(T = s I BnU)) < (1 - P., 2 ) n(l - A 


k<s 


A 


-4s 


< C'An-2‘^+^ (1- 

m / 

= CXn-^^+^ ( ( 1 - 

m / 


for some constant C that depends on t. It follows that 


Pa(T < oo I Br{j)) < C'An-^'^+i E [ 


For sufficiently large n we have n"(l — > 1, so the tail of the sum behaves like 

a convergent geometric series, and we bound 

Pa(T < cx) I Bii{j)) < \ogn ~ 

for all d>2. 

Since H{d,n) is transitive and there are |n'^(n — 1) edges, 

Ea[Z I B] < ^n^{n - 1)Pa(T < oo | Br{j)) ^ 0, 
and the claim now follows. □ 



CONNECTIVITY THRESHOLD FOR RANDOM SUBGRAPHS OF THE HAMMING GRAPH 9 


Completion of the proof: induction on the dimension. Recall that the case 
d = 1 initiates the induction, since is an Erdos-Renyi graph, so (2.1) holds. 

For the inductive step we assume that (2.1) holds for Hx{d — l,n), i.e., that for 
all t G M and all sequences A = A(n) such that hm„^oo A — (d — 1) logn = t, we have 

Fx{Hx{d — l,n) is connected) e“® *. 


We want to prove that the same holds for Hx{d,n). 

Given Hx{d,n), its intersection with the hyperplane Gjk has the same dis¬ 
tribution as Hd-iAd — l,n) since p = , and each vertex has (d — l)(n — 1) 

outgoing edges in Gjk- We assumed that lim„^oo A — dlogn = t, which implies that 

, d—1. ,, d — 1 

hm —-—A — (d — 1) logn = —-— t. 

n^oo d d 


Note moreover that is connected} and is connected) are i.i.d. random vari¬ 

ables when k ^ k' under since for hxed j all the subgraphs G^^^ are i.i.d. random 
subgraphs with the same law as Hd-i^(d — l,n). It thus follows by the inductive 

d 

hypothesis that the asymptotic probability that is connected is exp(— 

If we choose e > 0 such that a := exp(—— e > 0, then for each j the 
induction hypothesis and the Weak Law of Large Numbers imply that that 


/K2J 

PA(i?L(jr) = Pa 1 

\ k=l 


{G^f. is connected) — 2 ®^ 


<Pa| 


Ln/2J 


2 ^ _e-id-l)t/d 

/ ^ connected) ^ 

k=l 

The same is true for each Using the union bound. 


> e —)■ 0 as n —)■ cxo. 


P 


a 

-{Bj < (Pi(Bi(i)') + Pa(Br(j)‘)) ^ 0- 


(4.1) 


1 = 1 


Finally, we combine (3.2), (4.1) and Proposition 4.2 to obtain 
Pa(U > 0) < PA(idA(d, n) is disconnected) 

< Pa(U > 0) + Fx{B^) + PA({(e_ U J) ^ [nf} n R), 


completing the proof of the main theorem. 


□ 
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